Difference detection in LC-MS data for protein biomarker discovery
نویسندگان
چکیده
MOTIVATION There is a pressing need for improved proteomic screening methods allowing for earlier diagnosis of disease, systematic monitoring of physiological responses and the uncovering of fundamental mechanisms of drug action. The combined platform of LC-MS (Liquid-Chromatography-Mass-Spectrometry) has shown promise in moving toward a solution in these areas. In this paper we present a technique for discovering differences in protein signal between two classes of samples of LC-MS serum proteomic data without use of tandem mass spectrometry, gels or labeling. This method works on data from a lower-precision MS instrument, the type routinely used by and available to the community at large today. We test our technique on a controlled (spike-in) but realistic (serum biomarker discovery) experiment which is therefore verifiable. We also develop a new method for helping to assess the difficulty of a given spike-in problem. Lastly, we show that the problem of class prediction, sometimes mistaken as a solution to biomarker discovery, is actually a much simpler problem. RESULTS Using precision-recall curves with experimentally extracted ground truth, we show that (1) our technique has good performance using seven replicates from each class, (2) performance degrades with decreasing number of replicates, (3) the signal that we are teasing out is not trivially available (i.e. the differences are not so large that the task is easy). Lastly, we easily obtain perfect classification results for data in which the problem of extracting differences does not produce absolutely perfect results. This emphasizes the different nature of the two problems and also their relative difficulties. AVAILABILITY Our data are publicly available as a benchmark for further studies of this nature at http://www.cs.toronto.edu/~jenn/LCMS
منابع مشابه
Accurate inclusion mass screening: a bridge from unbiased discovery to targeted assay development for biomarker verification.
Verification of candidate biomarker proteins in blood is typically done using multiple reaction monitoring (MRM) of peptides by LC-MS/MS on triple quadrupole MS systems. MRM assay development for each protein requires significant time and cost, much of which is likely to be of little value if the candidate biomarker is below the detection limit in blood or a false positive in the original disco...
متن کاملThe Agilent HPLC-Chip/6210 TOF LC/MS Enables Highly Accurate Profiling of Peptide Maps for Differential Expression Studies
Liquid chromatography/mass spectrometry (LC/MS) based workflows, with their analytical power and potential throughput, play an important role in the discovery and validation of protein-based biomarkers. Current LC/MS workflows for biomarker discovery range from the classical shotgun proteomics approach to protein profiling strategies. However, no single LC/MS workflow has been adopted by the sc...
متن کاملA neural network approach to multi-biomarker panel discovery by high-throughput plasma proteomics profiling of breast cancer
BACKGROUND In the past several years, there has been increasing interest and enthusiasm in molecular biomarkers as tools for early detection of cancer. Liquid chromatography tandem mass spectrometry (LC/MS/MS) based plasma proteomics profiling technique is a promising technology platform to study candidate protein biomarkers for early detection of cancer. Factors such as inherent variability, p...
متن کاملShotgun Proteomics and Biomarker Discovery
Coupling large-scale sequencing projects with the amino acid sequence information that can be gleaned from tandem mass spectrometry (MS/MS) has made it much easier to analyze complex mixtures of proteins. The limits of this "shotgun" approach, in which the protein mixture is proteolytically digested before separation, can be further expanded by separating the resulting mixture of peptides prior...
متن کاملInformatics platform for global proteomic profiling and biomarker discovery using liquid chromatography-tandem mass spectrometry.
We have developed an integrated suite of algorithms, statistical methods, and computer applications to support large-scale LC-MS-based gel-free shotgun profiling of complex protein mixtures using basic experimental procedures. The programs automatically detect and quantify large numbers of peptide peaks in feature-rich ion mass chromatograms, compensate for spurious fluctuations in peptide sign...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Bioinformatics
دوره 23 2 شماره
صفحات -
تاریخ انتشار 2007